ARITHMETIC STRUCTURE IN SPARSE DIFFERENCE SETS 



MARIAH HAMEL NEIL LYALL KATHERINE THOMPSON NATHAN WALTERS 



Abstract. Using a slight modification of an argument of Croot, Ruzsa and Schoen we establish a quantita- 
tive result on the existence of a dilated copy of any given configuration of integer points in sparse diff'erence 
sets. More precisely, given any configuration {vi , . . . ,vi} of vectors in Z"*, we show that if A C [1, A'^]'' with 
\A\/N'^ > CN~^/^ , then there necessarily exists r ^ such that {rv\, . . . ,rvi} Q A — A. 



1. Introduction 

Many familiar theorems in mathematics have as a common feature the phenomenon that the set of 
differences from a sufficiently large set contains non-trivial structure. In this paper we study instances of 
this phenomenon in the finite setting of subsets of {1, 2, . . . , A^}'^. In this setting we are able obtain to 
quantitative structure theorems by adapting an argument of Croot, Ruzsa and Schoen 

1.1. Arithmetic progressions in sumsets. A good measure of the amount of additive structure in a given 
finite set of integers is provided by the size of the longest arithmetic progression that the set contains. 

In [3] Croot, Ruzsa and Schoen establish, using only simple combinatorial arguments, the following struc- 
tural result along these very lines for the sumset 

(1) A + B -.^ {a + b : a & A,h & B} 

of two given sparse sets A,BC [1, A^] = {1, 2, . . . , N}. 

Theorem A (Croot, Ruzsa and Schoen [3 ). Let A,BC [1,N] and m > 3 be an odd positive integer. 
If |yl||_B|/iV'^ > QN~'^/^"^^^\ then A + B contains an arithmetic progression of length at least m. 

We remark that if |A||i3|/iV^ > (logiV)^^^^, for any e > 0, then the conclusion of Theorem A can in fact 
be strengthened significantly. Using Fourier analytic techniques Green [4j, improving on previous work of 
Bourgain [2 , proved the following 

Theorem B (Green [4]). Let A,BC [l,iV], then there exist absolute constants < c < 1 < C such that if 

\A\\B\ ^ (loglogiV)^ 
7V2 - log N 

1 /2 

then A + B contains an arithmetic progression of length at least exp^c ^ ^^jjf^ logiV^ 

Ruzsa [TU] has shown that for any e > and N sufficiently large, there exists a set A C [1,N] with 
1^1/-^ > 1/2 — e whose sumset A + A does not contain an arithmetic progression of length exp((log iV)^^^^^) . 

In this paper we give a slight simplification of the proof of Theorem A that appears in 3^ and show how 
this can then be easily adapted to prove the natural higher dimensional generalization of Theorem A (as 
well as certain polynomial variants). 
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1.2. Multi-dimensional configurations in sparse diflference sets. The main result of this paper is the 
following quantitative multi-dimensional Szemeredi theorem for sparse difference sets. 

Theorem 1. Given any collection vi, . . . ,V£ of vectors in ifi there exists a constant Cg^d = Ce^di^i, ■ ■ ■ , vg) 
such that the difference set of any set AC [1, A'^]'' with 

will he guaranteed to contain some dilate of the original configuration {yi, . . . ,vg}, that is to say that there 
will necessarily exist an integer r > for which 

{rvi , . . . , rvg} A ~ A. 

Remark 1. Since A ~ A is symmetric it will also contain the reflection in the origin of this configuration, 
namely the configuration {— rwi, . . . , —rvg}. 

A simple averaging argument (that we will make use of again and again) allows us to deduce, from 
Theorem [U the following structural result for the sumset A + B oi two given sets A,B C [1, A^]''. 

Corollary 1. Let v^, . . . ,vg e Z'^ and A,B C [1, N]'^. If \A\\B\/N^'^ > 2'^Cg^dN-^/^ , then there exists r^O 
and t e [2,2NY such that 

t±r ■ {vi, . . . ,vg} C A + B. 



Proof. Since 

^ \Bn{t-A)\^\A\\B\ 

te[2,2N]'' 

it follows that there exists t £ [2, 2N]'^ such that if we set D = B D {t — A), then 

The result therefore follows immediately from Theorem [1] since D — D + tCA + B. □ 

Remark 2 (On the constant Cg^d in Theorem [T] and Corollary [1]) . Given any collection vi, . . . ,vg of vectors 
in Z'* let s denote the size of the largest projection of these given vectors onto the coordinate axes. In other 
words we define 

(2) s = s{vi, ... ,vg) = max \\vj\\oo = max{ \{vj,ei)\ : l<j<£, l<i<d}, 

i<j<i 

where {ei}i<i<:d denotes the standard basis vectors for Z^. 

It is clear that in order to obtain a non-trivial conclusion in Theorem [1] and Corollary [1] we must have 
r G [1, N/s], and in particular TV > s (a fact that will be forced on us by the choice of constant Cg^d)- 

In the proof of Theorem [Jl which we present in Section |4l we will see that we can in fact take 

(3) Cg,d := (2.n ri(l + \^^))"' < 2^(2.) 

It follows that we are able to recover Theorem A, the structural result for sparse sumsets of Croot, 
Ruzsa and Schoen, with only marginally weaker bounds (the constant 6 replaced with 8) as a special case 
of Corollary [TJ namely the special case d = I with Vj = j. 

Finally, it is also easy to see that for specific (and interesting) choices of vectors vi, . . . ,V£ G Z'^ the true 
value of Cg^d is significantly smaller than the trivial upper bound of 2''(2s)^/^. For example, in the case when 
i ^ d and Vj — ej (a d-dimensional corner), we have Cg^d ^ 2^~^^^^. 
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1.3. Outline of the paper. In Section [2] we present a proof of Tlieorem [T] in the special case when d = 1 
with Vj = j. As a corollary of this result (and some known quantitative results on polynomial patterns in 
difference sets) we give quantitative bounds on the size of a set A C [1, N] that will ensure that its difference 
set contains a long arithmetic progression whose common difference is a perfect square. 

In Section |3] we formulate a polynomial generalization of Theorem [1] namely Theorem [SJ and present a 
proof of this result in a special case (Theorem ^ . 

Finally, in Section U we prove Theorem [T] and sketch the proof of Theorem [S] 



2. Arithmetic progressions in sparse difference sets 



2.1. A special case of Theorem [Jl Although the following result can be extracted from Croot, Ruzsa 
and Schoen |3] we feel that the argument below is perhaps slightly simpler and easier to generalize. 

Theorem 2 (A special case of Theorem [T]). Let AC [1,N] and m > 3 be an odd positive integer. 

If \A\/N > 4A^^^/(™~"'^', then A — A contains an arithmetic progression of length at least m. 

Proof of Theorem\M Let m ^ 2£ + I. For each w = (wi, . . . , wi) £ ll wc defincQ 

n^ = {r£ [1, : jr + u;, G A (1 < j < £)} 
and note that if, for some w G Z^, there exist r',r" G 7?,^ with r' ^ r", then it follows immediately that 

j{r' ~ r") cA-A 

for each \ < j < I and hence, utilizing the fact that A — A\s symmetric, that the difference set A — A 
contains an arithmetic progression of length 2£ + 1. 

It therefore suffices to establish the existence of a w G such that \R,.u,\ > 2. In order to do this we 
(naturally) restrict our attention to those w for which TZ^ has at least a chance of being non-empty, namely 

W = {w G Z^ : 1 - jNjl < < iV - 1 (1 < j < ^)}, 

and note that 

|W| < iV^ J|(l + ill) < 2^N^. 

Since the average 

' ' uieW ' ' w£W r=l j=l I I 

it follows that there must exist a w G W such that 

'n'J £2^ 

and consequently, for this choice of w, that the set TZ^u will contain at least 2 elements provided 



N - 'm/<^ 

where Ce — 2(2^)^/^. It is an easy (calculus) exercise to finally show that 2 < Ci < 4. □ 

It is clear that the arguments presented above are fiexible enough to be applied almost verbatim to more 
general situations and at this point the reader is encouraged to prove Theorem [T] for herself (the details can 
be found in Section U). 

We now turn our attentions to the problem of finding polynomial configurations in difference sets. 



^ Of course N/£ need not be an integer, however here, and in the remainder of this article, we will make the slight (but 
convenient) abuse of notation of identifying [l,x] with [1, [x]] for any given positive real number x. 
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2.2. Some remarks on polynomial configurations in difference sets. The following result, a quan- 
titative polynomial Szemeredi theorem for difference sets, can be established by a careful application of 
standard Fourier analytic (circle method) techniques, see and 0- 

Theorem C (Lyall and Magyar |7j). Let Pi, . . . ,Pi e Z[r] with each Pj{0) — and k = maxj degPj > 2. 
There exists an absolute constant C = C(Pi, . . . , Pi) such that for any AC [1, A^] with 

\A\ ^ f jloglogNr ^'/"-'-'^ 
N - V ^ogN 
there necessarily exist r ^ such that 

{Piir),...,Pe{r)}CA-A. 

In the case of a single polynomial {£ = 1), this result was also obtained by Lucier [5] and, to the best of 
our knowledge, constitutes the best bounds that are currently known for arbitrary polynomials with integer 
coefficients and zero constant term. However, using some rather involved Fourier arguments, Pintz, Steiger 
and Szemeredi in j8] were able to establish the following impressive result for square differences. 

Theorem D (Pintz, Steiger and Szemeredi [8 ). Let AC [1, A^] and m>Z he a positive integer. If 

1,1 / , \ c log log log log N 



N - \\ogN 
then there necessarily exist r ^ Q such that r^ E A — A. 

We note that it is conjectured that for any e > and N sufficiently large there exists a set AC [1,N] 
with \A\ > N^~'^ that contains no square differences. Ruzsa [9 has demonstrated this for e = 0.267. 

Arguing as in the proof of Theorem[2]we can deduce, from Theorem D, the following result pertaining to 
arithmetic progressions with square differences. 

Corollary 2. Let A C [1, A^] and m > 3 be a positive integer. If 

I /I I /I \ 2c' loglogloglogAf/(m-l) 

N - \logN J 

then A — A contains an arithmetic progression of length at least m with common difference r^ (with r 

Proof. Let m = 2£ + 1. For each w = (wi, . . . , wi) G if' we again define 

Tln^ - {re [1, N/l] : jr + Wj E A {1 < j < £)}. 

It follows from Theorem D that if, for some w € we have 

ID I / I \ clog log log log(Ar/£) 



^ ' N/£ - \logN/£^ 

then there will necessarily exist r' , r" G TZ^ and r ^ such that r' — r" = r^ and consequently also that 

jr^ e A- A 

for each 1 < j < £■ Utilizing the fact that A ~ A is symmetric, it then follows that the difference set A — A 
contains an arithmetic progression of length 2£ + I with a square common difference. 

It therefore suffices to establish a condition on the set A guaranteeing that estimate (|4|) will hold. We 
saw, in the proof of Theorem [5] above, that there exists w = {wi, . . . , wg) G such that 

\RJ ^ f\A\V 1 



N/£ - \ N J 2^ 



ARITHMETIC STRUCTURE IN SPARSE DIFFERENCE SETS 



5 



Hence, inequality Q will hold provided 

Ml /I X clog log log log(Ar/£)/£ 

1^1 > 2C^I 



N - \logN/e, 

and the result follows. □ 

The methods of Pintz, Steiger and Szemeredi were later extended by Balog, Pelikan, Pintz and Szemeredi 
[T to obtain Theorem D (with the same bounds, but with the constant C now depending on k), and hence 
also Corollary [21 for fcth power differences. However, as we mentioned above, these impressive bounds have 
yet to be established for arbitrary polynomials with integer coefficients and zero constant term. 

3. A POLYNOMIAL GENERALIZATION OF THEOREM [T] 

3.1. A special case of Theorems |4] and [51 Before stating our polynomial generalization of Theorem [1] 
namely Theorem [5l (which contains as a special case a weak variant of Theorem C for sparse difference sets), 
we present the following very special case. It is our hope that working through this special case will help 
motivate the ultimate formulation of Theorem [SI 

Theorem 3 (A special case of Theorems [H and [SI . Let P{r) — akr^ + • • • + air + oq with each aj G Z and 
Uk > 0. If N is sufficiently large and A C [1, A^] with 

\A\/N > {4al^'')N~^^'' 
then there exist r', r" G N with r' ^ r" such that 

P{r') - P{r") eA-A. 

Note that a conclusion of the form P{r) G A— A for such sparse sets is forbidden, see the remark proceeding 
Theorem D concerning partial progress towards a conjecture of Ruzsa. 

Proof. It suffices to establish the existence of a w G Z such that \Tlw \ > 2 where 

7^„, - {r G [1, Ao] : P{r) + w A} 
with Aq = {N / OkY^^ . We define W to be the smallest collection of w G Z for which 

P(r) + W D [1, A] 

for all r G [1, Ao]. If A is sufficiently large (depending on the coefficients of P) it follows that 

max \P{r)\ < P(Ao) < 2 A 

r-e[l,Afo] 

and hence that 

W C {w G Z : 1 - P(Ao) < w < P(Ao) - 1} 
from which we can conclude that 

|W| < 4A. 

Remark 3. If P(r) > on [1, Aq], then we could conclude, as in the proof of Theorem [51 that in fact 
|yV| < 2 A (provided that A is large enough). In the further special case when P(r) = Ofcr*^ we note that 
one can drop the "let A be sufficiently large" assumption in the statement of the theorem. 

Since the average 

1 V- , , 1 V- / . X X l^lAo 

' ' weW ' ' wGWr=l 

for A sufficiently large, it follows that if 

\A\/N > {'ial^'')N-^/^ 

and A is sufficiently large, then there will necessarily exist w eW for which \TZ.u]\ > 2. □ 
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3.2. A polynomial variant of Theorem [TJ Let {vi, . . . ,vi} be a fixed configuration of vectors in Z''. 
Given polynomials Pi,...,Pi £ Z[r] with maxi<j<£ degPj < fc, our methods allow us to establish the 
following non-isotropic generalization of Theorem [1] 

Theorem 4. If A [1,NY with \ A\/N'^ > CN~^/^^, for some constant C, then there exist r' ,r" e N with 
r' ^ r" such that 

{(Pi(r') - Pi(r")) «i, . . . , (Piir') - P,{t")) v,}^A- A. 
Since, for each 1 < j < ^, we can write 

d 
i=l 

where Qij{r) :— Pj{r){vj, Ci) we see that Theorem 0] is a special case of the following more general result. 

Theorem 5. Let Q — {Qij}i<i<d be a fixed collection of id polynomials in Z[r] and k — maxg^gg degQij, 

i<j<e 

then there exists a constant Ci^d,k — Cg,d,k{Q) such that if N is sufficiently large and A C [1, A^]"* with 

\A\IN'' > C^.d,kN~^"'' 

then there exists r', r" G N with r' ^ r" such that 

d 

Y.^Q^J{r') - Qrjir")) ^A-A. 
1=1 

Remark 4 (on the constant Ci^d,k in Theorem [S|). If we now let t denote the absolute value of the largest 
leading coefficient of the polynomials that have the largest degree, namely 

(5) t^ max lim \Qti{r)\/r^ 

then, as we shall see in the proof below, we will be able take 

Ct,dM=^\2t^'''f" 

in general and replace the 4 with 2 in the special case where the Qij take only positive values on N. 

Note that we exactly recover Theorem [1] (with the same constant) as a special case of Theorem [Sj namely 
the special case where Qij{r) = r{vj,ei), since in this case we can drop the "let N be sufficiently large" 
assumption (see Remark [3] in the proof of Theorem , we leave this observation for the reader to verify. 

4. Proof of Theorem [T] and Theorem [S] 

4.1. Proof of Theorem [H Let s be defined by formula For each w = {wi, . . . , wg) £ {'Z'^Y we define 

TZu, = {r e [1, N/s] : rvj + Wj e A {1 < j < £)} 
and W to be the smallest collection oi w = {wi, . . . , Wi) e {Z'^Y for which 

for aU 1 < j < £ and r e [1, N/s]. 

As in the proof of Theorem[2]it will suffice to establish the existence of a w £ W such that \TZuj\ > 2. The 
fact that this happens whenever 

(!#)^|nn(i^^) 

^ ^ i=ij=i 
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follows immediately from the observation that the average 

1 ^ N/s i IN 

^ E i^^i = ^ E E n + = ^1^1 V 

and the (easily verified) fact that 



4.2. Proof of Theorem [5l Let i be defined by formula ([5]). For each w — (wi, . . . , wi) £ {Z'^Y we define 

d 

= {r e [1, iVo] : + ^ (r)e, G A (1 < j < ^)}, 

with TVo = {N/ty/'^. Since, for TV sufficiently large (depending on the coefficients of Qij), 

max |Q,,(r)|<|Q,,(7Vo)|<27V 

for all 1 < j < £, we will restrict our attention to those w that are contained in the set 

W = {we [Z'^Y : 1(^^,6^)1 < 27V- 1 (1 < j < ^, 1 < i < d)}, 
and note that |W| < {ANY"^ provided that iV is sufficiently large. Since the average 

Wi ^ I'^-i^M ^ Erii-(».+Eo«M».) > (!^)'$ 

I ' wew ' ' weW r=l j=l i=l ^ ^ 

for TV sufficiently large, it follows that there will necessarily exist e W for which \R.w\ > 2 provided that 

\A\IN'^ > (2i/'?4'iii/tt)jv-i/tt 

and TV is sufficiently large. 

References 

[1] A. Balog, .J. Pelikan, J. PiNTZ, E. SZEMEREDI, Difference sets without K-th powers, Acta Math. Hungar. 65 (1994), 
165-187. 

[2] J. BOURGAIN, On Arithmetic Progressions in Sums of Sets of Integers, A tribute to Paul Erdos, 105-109, Cambridge 

University Press, Cambridge, 1990. 
[3] E. CrOOT, I. RuzSA, T. SCHOEN, Long arithmetic progressions in sparse sumsets. Integers: The Electronic Journal of 

Combinatorial Number Theory, 7(2) (2007), #A10. 2005. 
[4] B. Green, Arithmetic Progressions in Sumsets, Geom. Funct. Anal. (2002), 584-597. 
[5] J. LuClER, Intersective sets given by a polynomial. Acta Arith. 123 (2006), no. 1, 57-95. 

[6] N. Lyall and a. Magyar, Polynomial configurations in difference sets, J. Num. Theory, v. 129/2, pp. 439-450, 2009. 

[7] N. Lyall and A. Magyar, Polynomial configurations in difference sets (revised version), axxiv.org/abs/0903.4504. 

[8] J. Pintz, W. L. Steiger, E. Szemeredi, On sets of natural numbers whose difference set contains no squares, J. London 

Math. Soc. 37 (1988), 219-231. 
[9] I. Z. RuzSA, Difference sets without squares. Period. Math. Hungar. 15 (1984), 205-209. 
[10] I. Z. RuzSA, Arithmetic Progressions in Sumsets, Acta. Arith. (1991), no. 2, 191-202. 

Department of Mathematics, The University of Georgia, Athens, GA 30602, USA 

E-mail address: mhajnel9inath.uga.edu, lyall9inath.uga.edu, thoinpson9inath.uga.edu, nwalters9inath.uga.edu 



